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Abstract 

This work derives a distributed and iterative algorithm by which mobile terminals can selfishly 
control their transmit powers during the synchronization procedure specified by the IEEE 802.16m and 
^ , the 3GPP-LTE standards for orthogonal frequency-division multiple-access technologies. The proposed 

solution aims at maximizing the energy efficiency of the network and is derived on the basis of a finite 
noncooperative game in which the players have discrete action sets of transmit powers. The set of Nash 
equilibria of the game is investigated, and a distributed power control algorithm is proposed to achieve 
synchronization in an energy-efficient manner under the assumption that the feedback from the base 
station is limited. Numerical results show that the proposed solution improves the energy efficiency as 
well as the timing estimation accuracy of the network compared to existing alternatives, while requiring 
a reasonable amount of information to be exchanged on the return channel. 
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I. Introduction 

The issue of energy efficiency has attracted a considerable interest in the information and 
telecommunication technology community during the last decade, as witnessed by the extensive 
literature available on this subject (see for example [1] and references therein). Among others, 
a challenge that lies in this paradigm is to prolong battery life of mobile terminals based on 
orthogonal frequency-division multiple-access (OFDMA) technologies such as those operating 
according to the IEEE 802.16m and the 3GPP long term evolution (LTE) standards. 
The first operation that must be accomplished by any terminal when joining the network is 
achieving correct synchronization with its serving base station (BS). This procedure is called 
initial ranging in IEEE 802.16m Q, and random access in LTE J3]|. It relies on a contention- 
based approach taking place over a specified set of subcarriers, which are used by each terminal 
to notify its entry request by transmitting a packet consisting of a randomly chosen code. 
Code identification as well as multiuser timing estimation are the main tasks of the BS during 
this procedure. These problems have received significant attention in the past few years, and 
some solutions are currently available in the literature (see for example [@]-[|7) and references 
therein). All the aforementioned works assume a deterministic increase of the transmit power 
upon successful synchronization without taking into account any energy efficiency issue. This is 
motivated by the fact that the energy efficiency problem in OFDMA-based technologies has been 
mainly analyzed for the data transmission phase (e.g., see [O and flU and references therein). 
A first attempt to reduce the power consumption during the initial synchronization phase can 
be found in ifTOl . in which a low-complexity and iterative algorithm is proposed to allow each 
synchronization terminal (ST) and the BS to locally choose the transmit power and the detection 
strategy, respectively. The goal is to obtain a good tradeoff between detection capabilities and 
power consumption while satisfying quality-of-service (QoS) requirements given in terms of 
timing estimation error and probability of false code lock. The proposed solution is based on 
a noncooperative game-theoretic formulation and it is shown to provide significant gains in 
terms of reduced synchronization time and parameter estimation accuracy compared to existing 
alternatives based on a deterministic increase of the transmit power. Although interesting from a 
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theoretical point of view, the analysis provided in [|T0l is not suited for practical applications since 
it relies on the assumption of a continuous set of transmit powers. Moreover, comparisons with 
existing alternatives are carried out assuming that STs have perfect knowledge of the signal- 
to-interference-plus-noise ratio (SINR) measured at the BS. A similar game-theoretic line of 
reasoning has been recently used for achieving synchronization in code-division multiple-access 
networks operating in a flat-fading scenario [fTT| . and in a frequency- selective one H3. 

Motivated by the above considerations, in this work we return to the problem discussed in 
ifTOl and extend both the power allocation approach and the numerical analysis as follows. We 
first assume that a finite set of transmit powers is available at each terminal. Compared to 
IfTOl . this more application-oriented assumption changes completely the nature of the energy- 
efficient optimization problem, as the tool of finite noncooperative game theory is used to find its 
solution lfl"3l . The set of Nash equilibria of the game is investigated and compared to that of the 
continuous-power noncooperative game discussed in [fTOl . The theoretical analysis of the finite 
game is adopted to derive an iterative and distributed power allocation algorithm for achieving 
synchronization under the assumption of a limited feedback from the BS. Numerical results 
are used to compare the performance of the proposed solution with that achieved by existing 
alternatives based on a deterministic increase of the transmit power (with and without contention 
resolution methods). It turns out that the proposed solution provides benefits in terms of energy 
efficiency and parameter estimation accuracy, using a reasonable amount of feedback resources. 

The remainder of this paper is structured as follows!] Section |n] describes the system model 
and introduces the problem. Section III formulates the game and investigates its equilibria. The 
analysis is used in Section IV to derive an iterative and distributed synchronization algorithm 
whose performance assessment is provided in Section V. Finally, Section [VI] concludes the paper 
and discusses the applicability of this technique to current wireless standards. 

'The following notation is used throughout the paper. Matrices and vectors are denoted by boldface letters. I n , n , and l n are 
the n x n identity matrix, the nxl all-zero vector, and the nxl all-one vector, respectively, whereas A = diag{a(n) ; n = 
1,2, ...,iV} denotes an N x N diagonal matrix with entries a(n) along its main diagonal. We use E{-}, (-) T and (-) H 
for expectation, transposition and Hermitian transposition, respectively, ||-|| for the Euclidean norm of the enclosed vector, 
[x\ to round x to the nearest integer towards zero, [x~\ to round x to the nearest integer, x\sa = 101og 10 a-, and finally 
[x] a = max(o, min(a:, 6)). 
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II. System model and problem formulation 

A. System model 

We consider the uplink of an OFDMA-based system employing N subcarriers with index set 
{0, 1, . . . , N — 1}. To avoid aliasing problems, 2N V null subcarriers are placed at the spectrum 
edges. The remaining N — 2N V subcarriers are grouped into synchronization subcarriers and data 
subcarriers. The former are used by the STs entering the network through a contention-based 
synchronization procedure, while the latter are assigned to mobile terminals for data transmission 
and channel estimation. We denote by K the number of STs, and assume that the synchronization 
subcarriers are divided into M subbands, each composed of a set of V adjacent subcarriers, which 
is called a tile. We denote by c k = [c fc (0), . . . , c k (MV — 1)] the code chosen by the kth ST, and 
call 9 k the timing offset of the A;th ST (normalized to the sampling period T s ). 

As in [[Toll , we consider a quasi-synchronous system in which no interblock interference (IBI) 
is present at the BS receiver, and we neglect any residual carrier frequency offset J^| Moreover, 
we assume that the channel frequency response is nearly flat over each tile, and users other than 
those performing synchronization have been successfully synchronized to the BS so that they do 
not generate significant interference. Under the above assumptions, the vector X(m) containing 
the mth-tile discrete Fourier transform (DFT) outputs at the BS can be written as 

K 

X(m) = VPk~C k (m)*(6 k )H k (m) + n(m) (1) 
fc=i 

where p k denotes the transmit power of the A;th ST, C k (m) = diag{cfc(m^), . . . , c k (mV + V — 
1)}, the vector &(9 k ) is given by 

a(0 fc ) = [1, e~ j2 ^ N , e -^(v-i)e k /N ]T (2) 

and n(m) is additive white Gaussian noise (AWGN) with zero mean and covariance matrix 
a 2 n l v . 

As mentioned above, the main tasks of the BS during the synchronization procedure are code 
detection and timing offset estimation. Following [TTOl . the kth. code c k is declared as detected 

2 This assumption is reasonable as long as downlink estimation errors are within a few percents of the subcarrier spacing and 
low mobility applications are considered (2). 
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if the following generalized likelihood ratio test (GLRT) is satisfied: 

A/ A /^ fc) > A (3) 

where the threshold A is a design parameter chosen so as to achieve a desired probability of 
false alarm il fa , and A(9 k ) is given by 

M-l 



m4) = i E 



m=0 



^(^)Cf(m)XK 



(4) 



In the above equation, is the maximum likelihood estimate of 9 k , given by IfTOl 



9 k = arg max_A(0 fc ) (5) 

0<9 k <6 

with 9 being the (normalized) round trip propagation delay for a user located at the cell boundary 

m. 

5. Problem formulation 

We let p = [pi,p2, • • • ,Px] T , and we define 7fc as the SINR of the kth ST, given by ifTOll 

Ik = Vk{v\k)Pk (6) 
where p\ k = p \ p k = [p u . . .,Pk-i,Pk+i, ■ ■ -,Pk} T , and v k (p\ k ) is defined as 

ffc(P\fc) = 2 | ^ (7) 

with ae = t/MJ2m=o \Hi{m)\ 2 being the ST ts average channel power gain across tiles. 

Following IfTOll , the energy-efficient optimization problem can be mathematically formalized 
for all STs k 6 K = {1, . . . , K} as 

* n dfc (7 fc ) 

P fc = arg max — - (8) 

p k €V k p k T 



subject to: MSE(0 fc ) < MSE e (9) 

where V k denotes the set of transmit powers and T is the duration of the cyclically extended 
OFDMA block, whereas MSE(0 fc ) = E{\9 k -9 k \ 2 } is the mean-square error (MSE) of the timing 



estimate 9 k , and MSEg is the network QoS requirement in terms of maximum timing estimation 
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MSE. In addition, U^kilk) represents the probability of correct detection of code c k given by 

ma 

IV (7*) = / (i +7fc)( i-A) [M(V - 1), M] (10) 

1 + (1-A) 7fe 

where I x [-, ■} is the incomplete beta function [15J. Unlike ifTOl . we assume that the power strategy 
set Vk is finite and given by 

* = {»f ,»»... ,»«*>} (id 

where the number of power levels Q k is computed as 

Qt = 1 + P k u-P k u (12) 

^fc|dB 

with A/,. > 1 being the quantization step, and p k and p k the minimum and maximum power levels, 
respectively. From (fTTT) and (Jl2j> , it follows that irjp = p , ^ = p k , and 7rJ^ = p k ■ A£ _1 . 

Setting V k as specified in ([111) allows us to meet the technical requirements of practical 
systems in which transmit powers are usually equally spaced on a logarithmic scale to reduce 
the complexity of the front-end architecture and to increase the efficiency of power amplifiers 
(see for example the specifications provided by the IEEE 802.16m and 3GPP LTE standards 
in [|2l and 0, respectively). For simplicity, in all subsequent derivations we assume A k = A, 
p k = p and p k =p for all k E JC. This also implies Q k = Q. 

As mentioned previously, the aim of this work is to solve d8) taking into account the discrete 
nature of the power strategy sets {V k }. In the sequel, this is achieved by resorting to the analytical 
tools of finite game theory lfT3l . 

III. Game formulation and analysis 
Using the results illustrated in ffTOl . it follows that the MSE in © can be met provided that 

Vk > < 13 > 

MP\k) 

where 7 rcq denotes the minimum SINR such that the MSE constraint © is satisfied with equality, 
i.e., MSE(#k) = MSEg. In particular, 7req is found to be IfTOl 

3iV 2 1 

7req ~ 2Mvr 2 (y 2 - 1) ' p { } 
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where p is defined as p = MSE e — n 2 (6 k ) with p,(9 k ) = E,{9 k } — 6 k denoting the bias of the 
timing estimate 8 k . Using (fl"3l) . the optimization problem in ([8]) can be reformulated as 

* n d ,fc(7fc) 

Pk = ar S max 7fT~ (15) 

where 

A(p\fe) = {pk e V k : Pk > -^4) (16) 
I "fc(P\fc)J 

is the power strategy subset that allows the fcth ST to meet the MSE constraint in ©. For 
notational simplicity, in all subsequent derivations we omit the functional dependence of A k on 

a 

The power allocation problem in (fT3T ) can be formulated as a generalizedj noncooperative 
game with complete information ffT3l . denoted by Q = [K,, {A k }, {u k }}. In particular, /C = 
{1, 2, . . . , K} is the player set, A k is the action set of the A;th player, and u k is its payoff 
function given by 

/ v n difc (7 fc ) 

Ufc(p) = — (17) 

which depends on the power allocation p through 7^ as in ©. The discrete nature of A k places 
Q into the category of finite generalized noncooperative games. As mentioned previously, this is 
much different from [10], in which the continuous nature of the action sets allows us to formalize 
the optimization problem as an infinite generalized noncooperative game Q c . 

A. Analysis of the equilibria 

The existence and uniqueness of the generalized Nash equilibria (GNE) of Q are studied in 
the following. The analysis is conducted only for pure (i.e., deterministic) strategies. This choice 
is motivated by the fact that in compact strategy spaces, mixed (i.e., statistical) strategies are 
generally less attractive due to implementation difficulties in wireless communications systems 

ma. 

To proceed further, we recall that a vector p* = [p{,P2, ■ ■ ■ ,Pk] T ^ s a pure-strategy GNE of 
Q if, for any k 6 K., 

«*([Pfc,P\ fc ]) >«*([Pfc,PvJ) (18) 

3 The game is generalized since Ak depends on the other STs' power p\ fe (see for example 1131 and 1161 for more details). 
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for all transmit powers p k € A k - Another way to define a pure-strategy GNE is to make use 
of the concept of best response |[T3l . In particular, we have that a vector p* is a GNE if each 
element p\ is the best response r fc (p* fc ) to the powers p* k chosen by the other players, with 
r fc(P\fe) being the solution of the following problem: 

r k{P\k) = arg max — (19) 

x PkeA k p k l 

in which we have used (fT7l) . and we have explicitly written the functional dependence of the 
detection probability Il difc on p k and p* fc through 7^ in ©. 
Theorem 1: Let us define the SINR 7* as 

7* = max(7 rcq , 7) (20) 

with 7 being the solution of 



<9n difc ( 7 ) 



n d , fc (7) 



7=7 ^ 



Then, the game Q admits pure- strategy GNE provided that 



(21) 



i*(K - 1) < V. (22) 

The proof can be found in Appendix |A1 ■ 
Theorem 2: Let £* be the set of pure GNE for Q. Then, the cardinality of S* is such that 

\S*\ > 1. (23) 

The proof can be found in Appendix El ■ 
Theorem CD provides a sufficient condition for the set of GNE not to be empty, and Theorem |2] 
states that the GNE is not necessarily unique. This means that the uniqueness property proven 
in [fTOl for Q c no longer holds for Q. In other words, quantizing the set of actions makes the 
game Q lose the uniqueness property for the GNE. In this context, it is interesting to show the 
following result. 

Theorem 3: Among all p* G £*, the smallest component-wise GNE p^ is such that 



K 



p^ = arg max ^u fc (p*). (24) 
The proof can be found in Appendix O ■ 
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The above result states that p^ is the best GNE in terms of social welfare (joint optimization) or, 
equivalently, it is the most efficient GNE in a social sense [18). Note that this does not amount 
to saying that p^ is the socially optimum solution of ©, as noncooperative equilibria are known 
to be generally inefficient ifTB") . Improving the equilibrium efficiency is out of the scope of this 
paper and is left as a future work. 

B. Numerical analysis 

Unlike the unique GNE of Q c in [TTOl . the multiple equilibria of Q cannot be expressed in a 
closed form as a function of the network parameters because of the arg max operator in (fi~9T ). 
A numerical analysis is thus conducted to make comparisons and to evaluate the impact of the 
discretization of the action sets. To this aim, we concentrate on the optimal (in a social sense) p^ 
and resort to the exhaustive search method described in fli~9l to solve (124)) . The numerical results 
are averaged over 20, 000 independent realizations of a network with the following parameters: 
T s = 89.28 ns, N = 1024, M = 4, V = 36, TT fa = 10~ 5 , and MSE e = 324, which yield 
p = 128, A = 0.12, 7 rC q|dB = —6.19, and 7*|dB = 7|dB = 7.09 (see ifTDl for a detail discussion 
on this parameter setting). The normalized power constraints are fixed to p/cr^|dB = —20 and 
P/°"nldB = +30 for all k, whereas the ST distances d k are randomly chosen from a uniform 
distribution in [R/10, R] with R being the cell radius. The channel power gains are normalized 
to a distance R/2 and are modeled using the 6-tap ITU modified vehicular-A model ll20l with 
a path loss exponent q = 2. 

Fig. Q] reports the normalized MSE defined as NMSE(p*) = E{||p* - PaIIVIIPcII 2 } as a 
function of K for different quantization steps, where p* is the unique GNE of Q c . The maximum 
number of STs is fixed to [1 + V/l*\ = 8. Observe that the condition K < 8 is required to meet 
(l22l . In addition, it represents a necessary and sufficient condition for the existence of the unique 
GNE p* [fTOTl . As expected, NMSE(p*) decreases as A becomes smaller since the discrete action 
sets Ak in Q tend to better approximate the continuous ones in Q c . As can be seen, NMSE(p*) 
increases as K increases, meaning that the difference between ||paI| 2 an d ||p*|| 2 becomes larger 
as the number of STs increases. In particular, we see that NMSE(p*) is almost constant up to 
K = 4, whereas it rapidly increases for larger values. 

To evaluate the impact of this difference on the system performance in terms of social welfare, 
Fig. |2]reports the experimental J2k ^^(Pa) / J2k u k(pt) as a function of K in the same operating 
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conditions of Fig. [TJ As seen, the ratio J2k M fc(PA)/ Sfc M fc(P*) 1S approximately 1 for K < 
6 while it increases for larger values of K. A similar behavior (not shown for the sake of 
brevity) is observed if the user-basis ratio Uk(p\) / Mfc(Pc) is considered. From these results, it 
follows that limiting the STs to use a discrete set of strategies Ak increases the global system 
performance rather than introducing a detrimental effect. This phenomenon is known as a Braess- 
type paradox (|2T|. and it has already been observed in other different contexts (such as routing 
in [|22|. Il23l and wireless communications in [|24| . [|25l0 . Roughly speaking, the Braess-type 
paradox occurs because the average number of GNEs increases when K becomes larger. For 
example, when A|dB = 1, the numerical results indicate that the average number of GNE for 
K = {2, 3, 4, 5, 6, 7, 8} is {1.0, 1.1, 1.2, 1.3, 1.7, 2.7, 20.9}, respectively. 

IV. Energy-efficient distributed synchronization 

We now show how to exploit the GNE analysis provided so far to derive a practical power 
control algorithm for achieving synchronization in the network modeled as in Section III-AI We 
start denoting by Pk[n] the transmit power of the A;th ST at the nth iteration step. Then, we 
observe that, using the results illustrated in [|26l and [1271 . it can be easily proven that, under 
hypothesis (122)) . an iterative algorithm operating according to the best-response dynamic in (Tf9l) . 
i.e., 

r , n n d (v k {y\k[n})pk) n ~ 
Pk[n + 1 = arg max — (25) 

converges to the most socially efficient GNE if Pk[0] = p for k = 1, 2, . . . , K. Note that the 
computation of p k [n + 1] in (|25|) requires knowledge of u k (p\k[n]). Using ©, it follows that 
Vk(p\k[n]) can be obtained as 

^(P\*N) = ^{4 (26) 

where 7fe[n] is ST fc's SINR measured at the BS at time step n. While Pk[n] is locally available 
at the transmitter, 7fe[n] can only be estimated at the BS and sent to the A;th ST on a downlink 
control channel. Following [fTOll . an unbiased estimate jk[n] of 7fe[n] can be computed as 



E^ol|X(m)|| 2 -A fe ( 
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To reduce the amount of information to be exchanged, we assume the quantity %[n\ to be 
quantized on a logarithmic scale using a uniform £?-bit quantizer. This produces 

MfeNldB = A 7 |dB • b k [n] (28) 

where 



b k [n) 
and 



dB 



^7|dB 



(29) 



_ 7|dB-7|dB 

^ 7 ldB - 2B _ 1 (-W) 



is the quantizer resolution (also known at the ST side, e.g., selected by the system standard), 
with 7 and 7 being the maximum and minimum expected values for 7fc[n], respectively. The BS 
sends on a broadcast downlink channel the integer b k [n], which is used by the kth ST to retrieve 
the quantized version of %[n] using (1281) . 

Replacing 7fc[n] with jJL k {n\ into (|26|) and substituting the result into (|25l> . we eventually obtain 



p fc n + 1 = arg max — (31) 

Recalling (1281) . it follows that its evaluation at the ST side requires only the knowledge of b k [n). 

Collecting all the above facts together leads to the energy-efficient synchronization algorithm 
illustrated in Algorithm [Q which allows each ST to operate in a complete distributed manner 
without any knowledge of other users' power allocation strategies (as if in a single-user sce- 
nario). Observe that Algorithm [T| is reminiscent of the best-response synchronization algorithm 
(BRSA) illustrated in ifTOl . except for the discrete action sets and the limited feedback from 
the BS, which makes it more suited for a practical implementation. In the sequel, we call the 
iterative procedure described in Algorithm Q] as discrete and limited feedback best response 
synchronization algorithm (DLF-BRSA). 

Remark 1: It is worth observing that, in their most basic forms, iterative algorithms based on 
best-response dynamics require a significant amount of information to be locally available at the 
player (transmitter) [28]. For example, they usually require knowledge of the number of players 
and of the actions played by all the other players. To overcome this problem, other algorithms 
based on reinforcement learning techniques have been adopted in the literature [|2~8l . The main 



March 13, 2013 



DRAFT 



1 1 



Algorithm 1 Discrete and limited feedback best response synchronization algorithm (DLF- 

BRSA) 

a. Initialization: each ST k — 1, 2, . . . , K 

al) initializes the transmit power p k [0] to the lowest power value p; 
a2) sets n = 0. 

b. Iterative algorithm: at each step n, each ST k 

bl) receives on a common downlink channel the result of the GLRT © for code c k and the 

integer b k [n] computed through (1271) - (|30T ); 
b2) if the GLRT for code c k is verified and n k [n] > 7 req , exits the algorithm (i.e., ST k is 

successfully associated to the BS), otherwise goes to the next step; 
b3) adjusts its transmit power according to (ED; 
b4) updates n — n + 1. 



advantage of these solutions is that they do require each player to know only its corresponding 
utility. Although based on best-response dynamics, DLF-BRSA possesses most of the advantages 
shown by other reinforcement learning-based algorithms, as it allows each ST to operate in a 
distributed and iterative way requiring only knowledge of its own estimated SINR, as illustrated 
in Algorithm [Q 

Remark 2: Most of the computational complexity of DLF-BRSA is represented by the ex- 
haustive search in (13TI) . which must be performed at each iteration step n over the Q discrete 
power levels of the set A k . In those applications characterized by large values of Q, this may 
represent an implementation impairment. In such cases, one may resort to the supermodularity 
properties of the utility function u k (see Appendices [A] and |B]) and reduce the search complexity 
looking only at the values of p k G A k in the neighborhood of p k [n] . 

V. Simulation results 

Numerical simulations are now used to assess the performance of DLF-BRSA and to make 
comparisons with existing alternatives. As in Section ITlI-B L the numerical analysis is conducted 
by averaging over 20, 000 independent realizations of a network whose parameters are fixed 
as follows: T s = 89.28 ns, N = 1024, M = 4, V = 36, TT fa = 10~ 5 , and MSE e = 324, 
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which yield p = 128, A = 0.12, 7 req |dB = —6.19, and 7*|dB = 7|dB = 7.09. The minimum and 
maximum expected values of %[n] in (|3Q|) . based on an extensive simulation campaign, are fixed 
to 7|dB = — 8 and 7~| d B = +16, whereas the appropriate number of bits B is chosen later on the 
basis of the following numerical analysis. Without loss of generality, we concentrate on the first 
ST (i.e., k = 1) and assess the performance of the investigated solutions when its distance d\ is 
kept constant. All other STs are assumed to be randomly located in [-R/10, R] with R being the 
cell radius. The normalized power constraints are fixed to p/cr^|dB = ~ 20 and p/cr^|dB = +30, 
and the same power initialization Pk[0] = p is used for all STs k E JC, which also use a common 
power quantization step A| dB = 1. 

Fig. [3] reports the average normalized power expenditure p avg /cr^ (in dB) required by DLF- 
BRSA for successfully completing the synchronization procedure. The numerical results are 
plotted as functions of K for B = {1,2,3,8}. The results obtained with DLF-BRSA when 
B — > oo (i.e., with continuous-SINR feedback) are used as a benchmark. Comparisons are also 
made with the BRSA illustrated in |[T0l in which the action sets are continuous and perfect 
knowledge of the estimated SINRs is available at the STs. The results of Fig. |3] indicate that 
the quantization of the SINRs has only a marginal effect on the performance of DLF-BRSA. In 
fact, it has practically the same performance for 5 = 3,8 and B — > oo, whereas a significant 
degradation is observed only for B = 1. We argue that the quantization of the estimated SINRs 
marginally impacts the system performance since it is basically perceived at the STs as an 
additional estimation error introduced by the BS (which on the other hand can actually exploit 
real-valued estimation methods). Based on the above results, in all subsequent simulations we 
set B = 3. From (l30l) . recalling that 7| dB = —8 and 7~|dB = +16, we have A 7 | d B = 3.43. 

To evaluate the impact of the discretization of the action sets, we now compare the performance 
of BRSA with those of DLF-BRSA. From Fig. [3l it follows that they do perform identically when 
the DLF-BRSA uses B > 3. This seems to contradict the numerical results of Fig. |2] discussed 
at the end of Section IIII-Bl which show that discretizing the set of strategies is beneficial for 
individual (and, consequently, global) performance. On the basis of the analysis of Section ITlI-B I 
the DLF-BRSA is expected to outperform the BRSA. The motivation behind this contradictory 
result can be understood by recalling that the considered ST takes part in the synchronization 
procedure as long as it is not correctly detected by the BS. As a consequence, what really 
impacts on the performance of DLF-BRSA and BRSA is its power evolution from the time it 
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enters the network (n = 0) to the time step n exit in which the exit conditions (detailed in Step 
b2 of DLF-BRSA) are satisfied. 

To this aim, Fig. |4] reports the average number of iterations n™ 1 as a function of K, which 
turns out to be the same for the both the BRSA and the DLF-BRSA (with B > 3) with 
good approximation. Interestingly, numerical simulations confirm that, when < n < n exit , the 
difference between the power updates across the two schemes is negligible. On the contrary, the 
performance measured in Fig. |2] corresponds to that achieved by the BRSA and the DLF-BRSA 
schemes without the exit conditions listed in Step b2 (as the GNE, computed in Section IIII-BI 
through an exhaustive search [ 19 ], can also be achieved using the best-response dynamics 
described in Section ITVl) . whose convergence time is usually much higher than n exit . This is 
the reason why the performance in terms of total energy expenditure reported in Fig. |3]is similar 
in the two cases. On the basis of the above results, we can conclude that DFL-BRSA yields 
the same performance of BRSA. However, this is achieved while i) reducing the complexity 
of the user terminals (thanks to the discretization of the power amplifier), and ii) requiring a 
limited amount of feedback from the BS (thanks to the finite number of bits B used to send the 
estimated SINRs). 

The performance of DLF-BRSA are now compared with those achieved by two alternative 
solutions based on a deterministic increase of the transmit power: the deterministic synchro- 
nization algorithm (DSA), in which the update rule is p k [n + l]| dB = Pfc[n]|dB + A| dB ; and the 
binary exponential backoff DSA (BEB-DSA), in which p k [n + n e ]|dB = PfcNIdB + A| dB , where 
n e is an exponentially-distributed backoff counter (see [flOl for more details). In all subsequent 
simulations, we assume K = 5 and set A| dB = 1 and Pk[0] — P for k — 1, 2, . . . , K. 

Fig. [5] shows Pavg/^n f° r a ^ investigated solutions as a function of the normalized distance 
d\/R, and Fig. [6] illustrates the average time T avg needed to complete the synchronization 
procedure. In particular, T avg is computed as T avg = Tj ■ n e ^, where Tf = 5 ms accounts 
for the time interval (frame time) between two successive synchronization attempts. In addition, 
Fig. U\ shows the MSE of the timing estimate Q\ for different values of d\jR. 

From the results of Fig. |5] it follows that DLF-BRSA provides roughly the same power 
consumption of BEB-DSA, which is significantly lower than that needed by DSA. However, 
the results of Fig. |6] show that the time required by DLF-BRSA to achieve synchronization 
is much shorter than that needed by DSA and BEB-DSA, especially when d\/R increases. In 
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addition, Fig. [7] shows that the estimation accuracy with DLF-BRSA is higher than that with both 
DSA and BEB-DSA. Collecting all the above facts together, we may conclude that DLF-BRSA 
provides better results in terms of energy efficiency and parameter estimation accuracy, also 
providing some performance that slightly depends on the transmitter-receiver distance. This is 
achieved at the price of a slight increase of information to be fed back over the control channel. 
In particular, the amount of information to be exchanged during each frame and for each q E C 
is the following: 1 bit to broadcast the outcome of the GLRT, and B = 3 bits to transmit the 
quantized SINRs. This means that a total of A\C\ bits per frame time Tf is required by DLF- 
BRSA, which corresponds to a feedback rate on the order of a few tens of kb/s, given that \C\ 
is usually on the order of tens to hundreds (e.g., see [0, []3]). 

VI. Conclusion 

In this work, we have formalized the power allocation problem for energy-efficient contention- 
based synchronization in OFDMA-based networks as a finite constrained noncooperative game. 
The generalized Nash equilibria have been analytically studied, and numerically evaluated. The 
above results have been used to derive a distributed and iterative energy-efficient power control 
algorithm with discrete powers and limited feedback. The performance of the above solution have 
been evaluated and compared with alternatives by means of numerical simulations. Using realistic 
system parameters and widely agreed-upon channel models, we have shown that the proposed 
solution incurs only a negligible degradation with respect to the scheme illustrated in [fTOll . while 
a significant gain is achieved with respect to deterministic -based power allocation approaches 
(both with and without contention resolution methods). The derived technique requires a feedback 
on the downlink on the order of a few tens of kb/s, which can be easily accommodated in current 
IEEE 802.16m and LTE @ standards. Since the proposed solution shows a (much) faster 
synchronization time than deterministic methods, it can be used to further increase the energy 
efficiency of mobile terminals by reducing the frequency of periodic ranging procedures, that 
are currently used by 4G communication systems to meet the transmission latency requirements. 
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Appendix A 
Proof of Theorem [j] 

Fig. [8] shows a pictorial representation of the typical shape of the utility u k (p) as a function 
of the power p k = 7r£ for a fixed interference p\& (all quantities are on a logarithmic scale, 
although the subscript 'dB' is suppressed for the sake of presentation). The relevant points of the 
utility function in terms of SINR, scaled by the quantity z^(p\fc)> are also reported: in addition 
to 7 req and 7, defined in (fl4l) and (T2TI) . respectively, Fig. [8] also shows the inflection point 7, 
7 < 7 < 7*, such that nd,A:(7fc) is strictly convex for j k < 7, and strictly concave for j k > 7. 
Although Fig. [8] depicts the case 7 rcq < 7, the considerations drawn in the following apply in the 
case 7 rcq > 7 as well. Circular markers report u k ([7r k \ P\fe]) for g = 1, . . . , Q (in this example, 
Q = 8). Note that the best-response map defined in ([191 ) may yield r k (p\ k ) < 7*/Vfc(P\fc)> as 
occurs in this example. 

A GNE in the game Q exists provided that the K sets Ak(p\k) Q V k , k G /C, are nonempty, 
which translates, using (fT6l) . into ensuring that there exists at least a power level nf 1 G V k such 
that 7r£ > 7req/' / fc(p\A:) for all fc. Since 7 rcq < 7* by hypothesis, it is sufficient to show that 
7* I ' u k(P\ k ) < r fc(p* fe ) = Pi for all G /C. By following the same steps as in ifTTI . we can derive 
the sufficient condition (|22l) . which becomes also necessary in the case 7* = 7 req . Note that, 
unlike |fTTfl. here we cannot derive a necessary condition that holds for any 7 > 7 rcq , because of 
the inequality 7*/Vfc(p\fc) ^ r fc(P\fc) that is originated from using a finite set, and hence GNE 
might exist even though (l22l) is not fulfilled. This is also in accordance to what highlighted in 
(TT9l) . as an equilibrium can exist even if p* k < 7*/i/ fc (p* fc ), provided that 7* > 7 req . 

To proceed further with the proof of existence, it is useful to introduce the following definition: 

Definition 1 (KTTti): A best response r k (p\ k ) possesses the ascending property if r k (p\ k ) < 
r fc(p'\fc) for all k G JC when p\ k 7^ pi k is such that p e < p' e \f£ 7^ k. ■ 

To show that the best response <TT~9T > is ascending, let us define r k = r k (p\ k ), r' k = r k (p[ k ), 
v k = Vk(p\k), and v' k = v k (p'\ k ) for notational convenience. Note that assuming p\ k ^ p'^ k with 
Pe < Pe ^ 7^ k implies v k > v' k . Let us consider two different cases: 

a. r k < 7*/V fc : in this domain, u k ([p k ,p'y k }) is an increasing function of p k . Hence, power 
vector p'^'s best response is r' k = argmax Pfe u k (\p k , p'^]) > r k . 
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b. r k > j*/u k : in this domain, since v k > v' k , both u k (\p k ,p\ k }) and u k (\p k ,p{ k }) are de- 
creasing functions of p k . Hence, for r k to be a best response, the condition u k ([r k , Pu]) > 
u k ([r k /A,p\ k ]) must hold, with r k /A < ^ jv k . Due to the asymmetry of u k ([p k ,p\ k ]) 
with respect to the point of maximum 7*/i>%, r k /A < p k , where p k , 2^*/u k — r k < 
Pk < 7*M < 7*/H> is the power level such that u k ([p k ,p\ k ]) = u k ([r k , p\ k ]). Since 
Uk(\pk,P\ k ]) is increasing in the region p k < ~f*/v' k , u k (\p k , p\ k ]) < u k ([p' k ,p\ k }), where 
p' k > p k is the counterpart power on u k ([p k ,p'^ k ]) such that u k (\p' k ,p[ k )) = u k ([r k ,p{ k ]), 
where the inequality p' k > p k follows from the fact the maximum of u k ([p k , p( J) is placed 
at Y/K > 7*M- Hence, u k ([r k /A, p\ k ]) < u k ([p' k , p'^ k }) = u k ([r k ,p'^ k }), which implies 
r k = r k- 

As a conclusion, r k (p\ k ) is an ascending best response function. In particular, if (|22)) is 
satisfied, this implies that r k (p\ k ) > r k (p\ k ) for all k and for all p such that p k > p k , where 
the vector p = [pi, . . . ,Pk] T is the minimum component- wise power allocation such that 7 fc = 
u k(p\k)Pk > 7 for all p k , with 7 < 7 defined above. In other words, the equilibrium points of 
Q (if any) are equal to the equilibria of a modified (generalized) game, which differs from Q as 
now the strategy spaces are subset of V k such that any vector allocation p is such that p k > p k , 
and hence 7^ > 7. To conclude the proof, let us introduce the following definition: 

Definition 2 (/UH/. /l29l/): A game is supermodular if u k (p) has increasing differences in 

P = [PfciP\fc]» i-e-, if 

Mfc(p) - Mfc([p'fe, p\ fe ]) > Ufc([p fc , p'\ fc ]) - Mfc(p') (32) 

for all p and p' such that, for all k, p k < p' k , and pi < p' e for all £ ^ k. ■ 
To prove that the utility u k (p), which is twice differentiable, satisfies (1321) . we can show, 
using ||29ll , that possesses the necessary and sufficient condition d 2 u k (p)/dpedp k > for any 
two components p e ^ p k . Using (fTVl) . we can easily derive 

d 2 u k (p) = _ n^( 7fc ) 7 l ~ Vilk) > Q 

dpidp k Va k p\ 
where the inequality follows from the fact that I\.'^ k {^ k ) = d 2 Yl^ k (^j k ) / 'd^ k < for 7 < 7^ < 7, 

and /(7 fe ) = 7*. • dIi Atk {j k ) / d^ k - n d;fc (7 fc ) > for 7 fc < 7 (see OH, Ell for further details). 

Proving the property of supermodularity concludes the proof, as supermodular games admit 

pure-strategy equilibria iTPl . As a consequence, the original game Q has pure-strategy GNE, 

under the sufficient condition (1221) . 
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Appendix B 
Proof of Theorem [2] 

To show that the GNE of the game Q is not necessarily unique, we use a counterexample. Let 
us focus on one GNE p*, whose existence is ensured by Theorem [TJ and let us suppose that p* 
is such that pi < j*/(i/ k y/~A), where v' k = v k (p'\ k ) is obtained using the vector p'^ fc = Ap* such 
that all components are scaled by the quantization step A, i.e., p' k = Ap* k . Under this hypothesis, 
p' is also a GNE of Q, i.e., r k {p'\ k ) = p' k VA; G /C. 

To show this property, let us note first that v k < v' k < v k j A, where u k = u k (pt k ). Hence, 
if Pi < 7*/( z/ *A) is the best response p k = r k (p* k ), it implies that p' k = Ap k < 7*/^ is also 
Pk = r fc(P\fc)' as Wk - 7*MI < \Pk ~ 1*1 u k\ and u k(\Pk, PyJ) is an increasing function of p k . 
If j'/(u k A) < p* k < 7 / /(z/£v / A), then 7'/^ < p k < j*VA/v k , i.e., p' k = Apl is greater than 
the point of maximum 7' '/u k . Furthermore, \p' k — 7*/Vfel < IK — TV^fel- ^ ue to the asymmetry 
of Ufe([p fe ,py), this ensures that u k ([p' k , p'^ k }) > u k (\p* k ,p'^ k ]), and thus p' k = r k (p[). As a 
conclusion, p' = Ap* is also a GNE of Q, and this concludes the proof. 

Note that, when K > 1 (e.g., K = [1 + V/j*\), it is often the case that J2e^k a ^Pt ^ 
Vfc. Hence, v' k > v k / A, and the condition p£ < 7*/(z/^,v / A) occurs frequently. This is the reason 
why the number of GNE increases as K increases. However, such condition is not necessary, 
and other GNE might exist, e.g., vectors p' in which some elements are p' k = Ap* k and some 
others are p' k = p k . 

Appendix C 
Proof of Theorem [3] 

By using the relation CD2), u&(Pa) = u k([PA, k , P A ,yJ) - u k{\Pk, P A ,\ fc D for a11 k and for a11 
p k G TV In particular, u fc ([p Aife , P AA J) > Wfc(K,P A ,\ fe ]) for an y Pfc in an y P* G f *> P* ^ Pa- 
Note also that 7^ = ffc(p A yjK — u k(P\ k )p k = l k for all fc, and 7^ > 7^ for some fc, under the 
hypothesis p Ak < p* k for all k, p A 7^ p*. As a consequence, itfc([Pfc, P A ,\ fe D = ^-&Al'k) I Pk > 
n<i,fe(7fc)/Pfc = Mfc(p*)- Since «fc(p A ) > ^ fc (p*) for all G /C, with strict inequality for some fc, 
([24]) follows. 
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Fig. 3. Average power consumption as a function of the number of STs. 



BRSA 




'2 3 4 5 6 7 8 



K 

Fig. 4. Average number of iterations as a function of the number of STs. 
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Fig. 5. Average power consumption as a function of the normalized distance. 
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Fig. 6. Average synchronization time as a function of the normalized distance. 
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Fig. 7. Average timing MSE as a function of the normalized distance. 




Fig. 8. Utility as a function of the transmit power for a fixed interference p\ fe . 
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